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In order to profit from the high granularity of the calorimeters proposed for the ILC 
that are suitable for the Particle Flow Approach, specialised clustering algorithms have 
to be developped. GARLIC is such an algorithm with the goal to find and identify 
pointing photons in the electromagnetic calorimeter. This would help to improve the 
energy resolution on the photon contribution in jets. 

1 Introduction: The GARLIC algoithm 

The GARLIC algorithm is based on the REPLIC package. Its main idea is to make use of 
the typical shape of an electromagnetic shower. Due to its high density, a simple neighbour 
criterion is sufficient for clustering. In this way, noise hits as well as fluctuations in the 
extremities of the shower are suppressed and at the same time the two-particle separation is 
optimised. GARLIC has benn implemented in the MARLIN framework ([5]) and is adapted 
to the Si/W ECAL, both for the C ALICE prototype as well as for the full detector model 
(LDC/ILD). To account for the differences in geometry, two versions of the algorithm exist. 
They are identical in terms of the actual clustering but adapted to the respective geometry. 
The main steps in the algorithm are: 

• Recursive prc-clustering based on the distance between hits 

• Seed search via two-dimensional projection of the energy deposit in the first 7Xq 

• Core building along direction found by the seed search 

• Iterative clustering based on neighbour criterion, proceeding from front to back 

• Rejection via simple criteria (number of hits, minimum energy, seed criteria) 

• Correction for energy losses in wafer guard rings and intcr-modulc gaps 

• Rejection with neural network output based on cluster variables such as eccentricity, 
width, direction, energy deposit in different parts of the cluster, etc. 

2 Characterisation of the prototype version 

The CALICE Si/W ECAL prototype has been tested in several test beams. GARLIC can 
be applied on data taken with electron beams to validate the clustering of electromagnetic 
showers. The basic concept of the algorithm works well. At a beam energy of 10 GeV at 
normal incidence, the mean fraction of clustered hits per event is about 97%, while the mean 
fraction of clustered energy is superior to 99%. This means that mainly low energy hits in 
the outer parts of the shower which represent fluctuations or noise hits are removed from 
the event. 
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2.1 Event cleaning 



Another important aspect of GARLIC is its capability to clean event samples in the proto- 
type test beam. To reject pion background a simple variable can be defined that measures 
the ratio of the energy deposit in the core of a shower to the total energy of the cluster. 
The core is hereby defined as all the hits with a distance of less than \pl x cellsize to the 
main axis of the cluster. This method proves as efficient as the rejection with a Ccrenkov 
counter. Since this detector was not available during all the data taking periods, GARLIC 
can be used to clean the event samples in these cases. Combining the two methods gives 
even better results. A second application for event cleaning is to require that exactly one 
cluster has been found in the event. This will reject multi-particle events as well as those 
where the incoming particle has started its shower upstream of the ECAL. 

2.2 Gap correction 

The Si/W ECAL shows some inefficient areas due to the presence of guard rings around the 
silicon wafers and the carbon fiber structure between modules. Detailed information can be 
found in [5] and GARLIC 's in-built gap correction has been kept very simple in order 
to keep it independent of the angle of incidence. So-called ghost hits are introduced in a 
gap between two hits. An energy deposit equal to the mean of the adjacent real hits, scaled 
by the fraction of the passive area to the area of one cell is attributed to this ghost hit. By 
applying this method losses can be reduced by about 5% for events in the inter-wafer gaps 
and by about 8% at the crossing of two such gaps. 

2.3 Impact on linearity and resolution 

It turns out that the clustering has only small effects on the linearity and resolution of the 
detector. As mentioned above more than 99% of the recorded energy is clustered. The main 
impact is then due to the gap correction that artificially increases the detector response at 
higher energies. To be consistent with the LDC/ILD version where the conversion from MIP 
to GeV has to be valid over a very big energy range and which induces a small non-linearity, 
the response is fitted with a parabolic function. As expected, the second order term turns 
out to be very small. The impact on the energy resolution is also small. The stochastic term 
s in the parametrisation = ^^^^ © c changes by about 0.1% (absolute) and the 

two values agree perfectly within the errors. 

3 Characterisation of the LDC/ILD version 

The LDC/ILD version has some more feautures than the prototype version, since it has to 
be adapted to a complete detector geometry and on more complex event types. 

3.1 Rejection of clusters from charged hadrons 

Separating clusters of electromagnetic showers from those created artificially, especially by 
charged hadrons that started interacting in the ECAL, needs several steps. The first and 
simplest one is to reduce the number of seeds that could be found from interacting charged 
hadrons. Therefore, tracks from charged particles are extrapolated into the ECAL volume 
and hits close by are removed from the pool of hits used for seed finding. Since only hits 
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in the first TXq are taken into account only early showering hadrons will generate seeds. 
With hadrons at 10 GeV, in only 13% of the events a seed is found. To remove the residual 
artificial clusters, a neural network has been trained with several variables describing the 
cluster properties. 
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Figure 1: Clustering efficiency for single 7-events (red dots) and percentage of events with 
artificial clusters from showers of single- tt's (black squares) in function of the energy of the 
simulated 7 and energy of the artificial cluster respectively 



3.2 Clustering efficiency and purity 



Simulated single-7 (0.2-150 GeV) and single- tt (1-110 GeV) events have been used to study 
garlic's clustering efficiency and the redidual fake clusters from hadron showers. In both 
cases the particle is required to enter the ECAL volume and not to interact in the tracker 
region. Figure [1] summarizes these results. The dashed blue vertical lines indicate the cluster 
energies that are grouped for the neural network decision. The red dots give the fraction 
of events with at least one cluster found, as a function of the the 7-encrgy in single-photon 
events. The lower energy limit for accepted clusters is 150 McV. At 500 McV the efficiency 
to find a cluster is already 96%, at 1 GeV it is 98.6% and larger than 99% for energies 
bigger than 1.5 GeV. The black squares give the percentage of single- tt events that show a 
fake cluster of energy E"^, while averaging over all simulated tt energies. One can see that 
it is very unlikely to create fake clusters above 1 GeV. At lower energies however, there 
are only few hits available to characterize the cluster, so selecting a fake cluster is more 
likely. A very important role is played by interactions in the tracker region. Rejection of 
the resulting fragments, that do not have any tracking information, proves near impossible. 
When allowing such interaction in the tracker region the background level more than doubles. 
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Figure 2: Total energy deposit from photons in 
two-jet events at Ecm = 200GeV. The definition 
of visible photons can be found in the text. 
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Figure 3: Correlation between 
number of clusters found by 
GARLIC and simulated photons 
in two-jet events at Ecm = 
200GeV 



3.3 Application on jets 

About 30% of the energy in jets is coming from photons. One of the goals of GARLIC is 
to optimise the resolution of this part of the jet energy and to isolate the photon clusters 
so following clustering algorithms can complete the particle flow by treating the charged 
particles and neutral hadrons. Two-jet events in the ILD detector at Eqm — 20006^ have 
been simulated and reconstructed with GARLIC. Figure [2] shows the sum of the energy 
deposited by all photons in the two jets. Photons are defined as visible (for GARLIC) 
if they have a minimum energy {E~^ > 150MeV) and if they enter the ECAL volume 
(|(cos6')| < 0.9774). One can see that the photon energy is quite well reconstructed. The 
correlation between the number of photon clusters and simulated photons as specified above 
is given in Figure [3l As one can see the number of photons in the jet is often overestimated. 
The combination of these two plots suggest that there is still a significant number of artificial 
cluster background at very low energies. This is amplified by the creation of fake clusters 
due to hadron interaction as described in the treatment of single-particle events. 

4 Conclusions 

The study on test beam data shows that GARLIC does not have any negative infiuence on 
the energy resolution. The built-in gap correction helps to reduce the efi^ect of energy losses 
in non-sensitive areas of the calorimeter. To optimise this correction different approaches 
are being studied. GARLIC works for photons down to 150 MeV with high efficiencies and 
low background. Nevertheless hadron interaction in the tracker region degrades its (and the 
particle flow) performance. 
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